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METHOD AND SYSTEM FOR STAGGERED 
STATISTICAL MULTIPLEXING 

CROSS REFERENCE TO RELATED APPLICATION 

This application claims the benefit of US Provisional Application No. 60/426,646, 
filed on November 15, 2002, entitled METHOD AND SYSTEM FOR STAGGERED 
STATISTICAL MULTIPLEXING, which is incorporated by reference in its entirety. 

FIELD OF THE INVENTION 

The invention relates generally to coding/compression of digital video signals, and 
more particularly to improved bandwidth utilization during concurrent transmission of a 
plurality of video signals. 

BACKGROUND OF THE INVENTION 

Video signals are inherently highly complex and place substantial resource demands 
on the transmission medium arranged to transmit such signals - both bandwidth and 
transmission power. Because bandwidth (and often power, as well) is typically a constrained 
resource, various techniques have been developed to compress the video signals prior to 
transmission, and thereby reducing the required transmission bandwidth. These techniques 
typically first digitize the video signal in order to improve the efficiency of the compression 
process, as well as other transmission processes including error correction. 

A widely used methodology for compression and coding of digital video signals is 
MPEG-2, as described and codified in ISO 13818-2. Indeed, the MPEG-2 methodology is an 
integral part of the Advanced Television System standard as adopted by the Advanced 
Television Systems Committee (ATSC). MPEG-2 is also widely used for transmission and/or 
recording of NTSC (National Television Standards Committee) based material, particularly 
cable and satellite transmission and DVD recordings. A newer member of the MPEG family 
of coding/compression algorithms, designated MPEG-4, offers significantly improved 
bandwidth compression, but has not yet been adopted for use with ATSC or NTSC television. 

In the case where a plurality of substantially non-correlated video signals are 
transmitted simultaneously via a common transmission medium, such as broadcast television 
or CATV channels, a further reduction in the required transmission bandwidth can be 
achieved by use of a technique known as statistical multiplexing. The essential idea of 
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statistical multiplexing (hereafter usually abbreviated as "stat mux") is to dynamically spread 
the total transmission bandwidth among the plurality of signals, or channels. For example, at 
a given time, channel 3 may require a higher number of bits/pixel (due to a complex scene) 
when channel 5 does not need as many. Therefore, a statistical multiplexer allocates more bits 
to channel 3 than it does to channel 5 (stealing bits from channel 5). This situation is always 
changing and therefore the controller for the statistical multiplexer must be designed to react 
quickly to changes in video complexity on a given channel. 

Video complexity is very bursty and stat mux can enable a broadcaster to achieve high 
quality during the high-complexity video segments on a given channel. Even more valuable, 
a broadcaster can fit more video channels into the transmission bandwidth. That is, by taking 
advantage of the average video complexity being low on most channels, extra space is made 
to increase the number of video channels. 

The determination of which channel should receive the most bits is difficult, however, 
especially in a real time encoding system for broadcast. Most known solutions take a 
complexity measure from the input video frames on each channel, and then allocate a fixed 
number of bits for use on the channel over the next group of pictures (normally 12 or 15 video 
frames) or sometimes on the next picture. Because these systems are not frame synchronized 
(and even if they were), at any given stat mux allocation time, the type of frames requesting 
bits is a variable. 

A known statistical multiplexing system developed by the assignee of this application, 
Thomson Consumer Electronics, for a satellite television service, operates in a standard 
definition MPEG2 system with distributed hardware video encoders. Implementing stat mux 
on this system is difficult because each MPEG2 video encoder has its own rate control 
system, and stat mux must be layered on top of it. Bit allocations are at the GOP level only, 
and no frame level synchronization is possible. 

SUMMARY OF INVENTION 

The invention is a method and apparatus for us in a multi-channel video 
transmission system in which channel video segments are operated on by corresponding 
channel video encoders to encode the video segments into a plurality of frames organized into 
frame types having defined frame patterns. The frame transmission alignment is then 
arranged among a plurality of channels concurrently transmitted via a common transmission 
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medium. A specified frame type is arranged for transmission in each channel so as to avoid 
temporal alignment with other ones of the same specified frame type in other channels. 

In effect, the invention synchronizes multiple video transmission and encoding 
channels and then staggers the compressed frame types to distribute the bit allocations evenly. 

5 In one preferred embodiment, based on use of MPEG coding/compression of the video 

signals, the Intra-coded (I) frames are temporally displaced so as to avoid a temporal 
alignment of I frames among a selected group of channels sharing a common transmission 
medium. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 is a block diagram of a broadcast video transmission system incorporating 

statistical multiplexing, in accordance with the principles of the present invention; 

Figure 2 is an illustration of display order versus encoding order for a sequence of 
MPEG-encoded video frames in a single channel, in accordance with the principles of the 
present invention; 

15 Figure 3 is an illustration of synchronized, encoded video frames for a 4 channel video 

broadcast system, in accordance with the principles of the present invention; 

Figure 4 is an illustration of synchronized, encoded video frames for a 4 channel video 
broadcast system using staggered I frames, in accordance with the principles of the present 
invention; 

20 Figure 5 is an illustration of a video broadcast system using staggered I frames and 

having 9 video channels, in accordance with the principles of the present invention; 

Figure 6 is an illustrative GOP structure for implementing the invention in a multi- 
channel video broadcast system, in accordance with the principles of the present invention; 
and 

25 Figure 7 is a block diagram of an exemplary implementation of the invention in a 

video broadcast system. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention is directed to an improved method for providing statistical multiplexing 
in a transmission medium carrying a plurality of substantially non-correlated digital signals, 
30 and will be described hereafter in terms of a preferred embodiment - video signals transmitted 
via a broadcast video system or a CATV or satellite system. While that preferred embodiment 
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is premised on a staggering of the Intra coded frames (I frames) developed pursuant to the 
MPEG coding methodology, it should be understood that the methodology of the invention is 
intended to be applicable to the staggering of independently encoded frames in frame 
sequences of alternate coding/compression methodologies. Indeed, the method of the 
5 invention should be understood as being applicable to the provision of staggered statistical 
multiplexing for any plurality of substantially non-correlated signals transmitted via a 
common transmission medium where an identifiable independently-encoded frame is 
determined for given segments of the plurality of signals. 

A block diagram of a broadcast video transmission system is shown in Figure 1, the 
10 system being composed of N input video channels. Each video channel is frame synchronized 
(by Frame Sync 101) to a common video clock (to establish a frame period, among other 
timing functions). Therefore, all of the Video Encoders 103 are locked to the same video 
frame rate. A complexity measure is created from each encoder and delivered to the Stat Mux 
Controller 105. The Stat Mux Controller allocates the total available bit rate among the N 
15 Encoders 103. Based on signals from the Stat Mux Controller, statistical multiplexing is 

applied to the N video channels by Mux 107, with the output of Mux 107 provided as an input 
to Modulator 109. In the instant exemplary embodiment, the stat mux algorithm allocates 
new bit budgets for each channel for each frame period. 

The MPEG video coding/compression standard organizes the video frames into 3 
20 frame types: I, P, and B frames (Intra, Predictive, and Bi-directionally Predictive). I frames 
use no motion estimation from previous frames. P frames use the previous anchor frame for 
motion estimation, and B frames use the previous and future anchor frames for motion 
estimation. In general, I frames are the largest, in terms of compressed bits per pixel, 
followed by P frames, and then B frames. The relative sizes are unimportant for the 
25 description of the invention following. 

The distance between I frames (which corresponds to the length of the Group of 
Pictures, or GOP - Le. f there is one and only one I frame per GOP), distance between anchor 
frames and other aspects of the GOP are not defined in the compression standards. Therefore, 
an implementer must decide how to organize the GOP structure. For purposes of discussion, a 
30 GOP size of 9 frames and a distance between anchor frames of 3 will be assumed. (In the 
shorthand vernacular of the art, this is known as N=9, M=3). However, it should be 
understood that the described methodology of the invention is applicable to any type of GOP 
structure. 
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As is well known, the ordering of frame types in a GOP differs between the display 
order and the encoding order. While the I frame in the display order may well fall somewhere 
in the middle of the GOP, the frames are re-ordered during encoding since the B frames 
require an anchor frame in the future for encoding. This reordering is illustrated in Figure 2, 
5 which shows frame types and sizes versus time for an illustrative GOP in display order and 
encoding order (also known as transmission order). 

Figure 3 shows a GOP sequence for each channel of an illustrative 4 channel video 
broadcast system with synchronized frames. As is apparent from the figure, the I frames for 
all of the channels appear at the same time point, and thus the stat mux controller must 

10 allocate all the I frame bits simultaneously. This is a disadvantage for two reasons: 

1. The Intra frame is the most important frame in the GOP sequence since it is the first 
anchor frame for the GOP. That is, all the frames in the GOP have dependencies on 
the Intra frame. This creates considerable difficulty for the stat mux algorithm since 
all channels would benefit from high quality; and 

15 2. Nothing is known about the future frames {e.g., whether complexity will decrease or 

increase), and therefore only a uniform distribution of bits can be made. (While static 
characteristics for the program material on individual channels - e.g., sports, movies, 
etc. - may be available to the stat mux in order to provide a rudimentary level of 
complexity prediction, that is in addition to any stat mux algorithm). Future frame 

20 information could be created, but only by buffering up these frames, which is an 

expensive solution. 

GOP sequences for a similar 4 channel video broadcast system with synchronized 
frame periods is shown in Figure 4, but here the I frames have been staggered, in accordance 
with the principles of the present invention. Note that the frame sequences depicted by solid 

25 lines in Figure 4 are intended to represent the GOPs for each channel at start up of the frame 
staggering method of the present invention. However, to also illustrate the repeating pattern 
of the GOPs over time, the frames that would appear in each frame position preceding the I 
frame for each channel in subsequent processing intervals are shown in the figure by dotted 
lines. With the I frames staggered as illustrated in the figure, the stat mux controller is now 

30 given much more flexibility when allocating bits for each frame period. When allocating bits 
for the first frame period, it can give priority to the channel 1 Intra frame, then channel 4*s P 
frame, then channel's 2 and 3 B frames. If not enough bits are available for this time slot, the 
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B frames can be degraded with minor negative results (since the B frames are not anchor 
frames). 

Another advantage of staggered I frames, in accordance with the principles of the 
present invention, is that the frames farther away from the I-frame can also be degraded more 
5 than ones close to the I-frame. This is due to P frames being predicted from the previous P 
frame. Therefore, the farther away a frame is from the initial I-frame, the less effect a 
degradation of that frame will have on future pictures. 

One particularly important advantage of the frame stagger methodology of the 
invention is that it avoids the need for the overall stat mux algorithm to use a complexity 

10 measure in determining the allocation of bits among the channels. As will be well understood 
by those of skill in the art, stat mux algorithms typically measure GOP complexity and then 
assign a number of bits per GOP per channel. By staggering the high priority frames in the 
multi-channel GOPs, in accordance with the principles of the present invention, the need for 
that complexity measure is eliminated. It should also be understood in this regard that the 

15 staggering methodology of the present invention could otherwise be combined with the 

known stat mux algorithms (along with removal of the complexity measure functionality). It 
should be apparent, as well, that static constraints — such as giving a sports channel more bits 
than a movie channel — can still be used with the present invention. 

Yet another benefit of staggering is that the average bit budget per frame period is 
20 more uniform. That is, there is not a large I-frame spike as in the non-staggered case. A more 
uniform bit allocation per frame period helps the channel multiplexer achieve a low delay 
between the video compression encoders and the modulation, and ultimately the receiver. 
Another way to view this advantage is that the multiplexer does not need as much buffering to 
smooth out the peak bit rates coming from the encoders. 

25 The frame staggering of the present invention is hereafter described for a broadcast 

system in which there are many video channels to multiplex. As will be seen, when the 
number of channels is large, the overall system can be organized such that a uniform bit 
allocation method can be used. 

Initially, it should be observed that the optimum staggering order is not a simple I- 
30 frame time stagger. Rather, the optimum stagger order is achieved by keeping the distance 
between I-frames at a maximum on average. Of course, the number of video channels affects 
the actual spacing. In the case of 4 video channels illustrated in Figure 4, some extra space (2 
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frame periods) is created by alternating the spacing. In the case of 9 video channels (and a 
GOP of 9), then the spacing between I frames is only 1 frame period. 

To further illustrate the principles of the present invention, a frame staggering 
arrangement for a system with 9 video channels and a 9 frame GOP length is depicted in 
Figure 5. With the illustrated channel/GOP symmetry, the staggering order does not matter 
since the distance between I frames is 1 for all channels. Therefore the total bit allocation for 
each frame time is uniform. With such an arrangement, the staggering/bit-allocation 
algorithm is straightforward. For a given frame time, the channel with the I frame gets a pre- 
determined bit allocation (the highest priority), then the channels with the P frames get a 
hierarchical bit allocation based on their distance from the I frame, and then the same 
procedure for the B-frames as the lowest priority frames. As can readily be seen, this 
algorithm could be implemented with a look up table in the stat mux controller. 

The uniform I frame distribution (i.e., each successive channel having the I-frame 
shifted by one frame from the frame position of the I-frame in the prior channel) shown for 
the 9 channel case of Figure 5 can be extended to larger systems by selective manipulation of 
the GOP structure. Such an optimization of the GOP structure in order to create a uniform 
distribution of I frames is possible because GOP structures are not very restrictive - frame 
ordering information is in the bitstream such that a decoder can determine the correct order. 
Therefore some manipulation of the GOP structure can be done in order to achieve a uniform 
frame distribution system. Although a number of GOP structures can be determined to 
accommodate such a uniform frame distribution, and will be readily derived by those skilled 
in the art, the following methodology provides one particular such structure and may be 
regarded as a preferred embodiment of the invention. 

1 ) The number of frames in the sequence to be operated on by the stat 
mux of the invention should equal the number of channels, or an integral multiple 
thereof. 

2) An individual GOP length is not typically longer than 0.5 - 1 second 
for broadcast environments. This could be longer for non-broadcast environments 
(e.g., internet downloads). 

3) A GOP structure with 2 B frames between anchor frames is normally 
desired. However this results in a significant constraint on the possible GOP lengths 
available to satisfy (1) and (2) above. Therefore, to ameliorate this constraint, one 
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can shorten any B frame section of a GOP with only minimal loss of compression 
efficiency. 

An illustrative and exemplary set of GOPs developed according the described 
methodology for video broadcast systems operating with between 9 and 30 channels is shown 
5 in the tabulation of Figure 6. In order to satisfy the constraint on individual GOP length, it 
may be necessary (in the case of higher numbers of channels) that the frame sequence 
operated on by the stat mux of the invention be comprised or two or more GOPs. In that 
circumstance, the stat mux may, of course, process all of the GOPs comprising the frame 
sequence in one operation. It is noted that, where warranted for overall GOP length, the first 
10 B frame pattern in the GOP is shortened. However, it should be understood that any other 
sequence of B frames in the GOP may be selected for shortening. 

The essential restriction in the derivation of such a multi-channel GOP structure, as 
noted above, is to keep the number of frames in the GOP structure(s) equal to the number of 
video channels (or an integer multiple of them). 

15 It should be understood that there are other options for defining the GOP structure to 

carry out the frame staggering method of the invention. For example, in the case of 30 
channels, 3 GOPs of 10 frames each could be used - (e.g., IPBBPBBPBB, IPBBPBBPBB, 
IPBBPBBPBB). The tradeoff is that I frame intervals are shorter, but there is higher I frame 
overlap for processing by the staggered stat mux algorithm. 

20 In any alternative GOP structure, the established trade-offs for GOP structure design 

should still be considered. In particular, the shorter the GOP, the less efficient the 
compression process will be, since I frames are the largest frames. However, short GOPs also 
offer advantages since they allow shorter channel change times and reduced encoder drift 
periods. 

25 A system architecture to carry out the staggered frame stat mux of the invention can be 

implemented using simple and well defined elements. Moreover, that system architecture 
reduces design complexity when compared to a more traditional stat mux system. Some of 
the advantages for the invention may include: 

• A staggered stat mux broadcast system can be used without the need for real time 
30 complexity measures from the encoders. This simplifies the encoders and the stat mux 

controller. 
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• A simple bit allocation methodology can be used that is optimized for the broadcast 
system. That is, the same bit allocation values could be used for every frame time - 
the Intra frames getting the larger values, followed by P frames and B frames. 

The system architecture includes a frame synchronizer, which is normally provided in 
real time encoding systems in order that the encoder synchronizes itself with the incoming 
analog video. Various designs are known and may be used. Since all encoders are frame 
synchronized, in accordance with the principles of the present invention, the specific 
technique used does not matter as long as the frame periods are the same phase and period. 

In order to stagger the I-frames (and succeeding frames in a GOP) for the channel 
encoders, a simple register delay line may be implemented that delays the encoder reset signal 
by the frame rate clock. However, this will result in a shared reset for all encoders. Such an 
approach could be problematic in a multi-channel broadcast system because, in practice, an 
encoder must occasionally be upgraded or reset, but the service provider will not want to reset 
all the channels simultaneously each time this operation is carried out. 

Accordingly, for a preferred embodiment of the invention, the frame-stagger approach 
is to create a frame rate counter and a register based comparator for each encoder. The output 
of the comparator gives the correct phase for the reset for each encoder. In this way, each 
encoder can be individually reset. An exemplary architecture for implementing this preferred 
approach is schematically depicted in Figure 7. With reference to the figure, Frame Rate 
Counter 701 is driven by a frame rate clock and would normally be implemented as a digital 
counter. The Frame Rate Counter operates to synchronize the reset signals for the Video 
Encoders 709. Each encoder will be reset on a different frame boundary in order to start its 
sequence at the appropriate time to enable the staggered I frame start time, in accordance with 
the principles of the present invention. To achieve that end, each Encoder Phase Register 703 
is loaded with a frame offset for its corresponding video encoder. The timing diagram of the 
figure shows the frame offsets of the phase registers for illustrative phases of 4,7,10, and 13. 
The Comparator 705, which receives as inputs the outputs of the frame rate counter and the 
appropriate encoder phase register, provides an output that is active high when the frame rate 
counter value and the phase register offsets are equal. This comparator output signal is then 
gated by the reset signal (shown in the timing diagram) with AND Gate 707. Therefore, when 
an encoder reset signal, at output of the corresponding AND gate, is driven high, the encoder 
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will be reset on its appropriate phase at the frame offset established for that encoder in 
accordance with the frame stagger principles of the present invention. 

It is to be noted that the architecture depicted in Figure 7 is directed only to illustrate 
the inventive principles of the present invention. To place that architecture into the context of 
5 an overall stat mux system, such as depicted in Figure 7, it should be understood that frame 
stagger implemented by this arrangement eliminates the need for complexity and bit 
allocation signals and, as well, replaces the stat mux controller and frame sync blocks of the 
system shown in Figure 1. Stated somewhat differently, the frame stagger circuitry of Figure 
7 takes the place of the stat mux controller and frame sync blocks of Figure 1 in an overall stat 
10 mux system implementation. 

As new channels are added or deleted from the system, a new GOP structure and 
frame staggering order can be adopted easily by loading new reset phases into the comparator 
registers. 

It is to be emphasized that, with the staggered frame stat mux methodology of the 
15 present invention, the stat mux controller need only consider static channel characteristics and 
can ignore dynamic complexity measures. The system can of course be adapted to also use 
such complexity measures, and the bit allocation efficiency may be even further improved, but 
the principles of the present invention will deliver an effective stat mux capability with 
considerable simplification from systems that must rely on complexity measures. 

20 Numerous modifications and alternative embodiments of the invention will be 

apparent to those skilled in the art in view of the foregoing description. In particular, the 
application of the methodology of the invention to other or additional GOP structure 
arrangements is intended to be within the contemplation of the invention. It is also 
contemplated that the principles of the present invention will be applied in systems operating 

25 under alternate coding/compression standards, including MPEG-4, H.26L, and proprietary 
systems (Windows Media Player™, Reed Player™, etc). 

Accordingly, this description is to be construed as illustrative only and is for the 
purpose of teaching those skilled in the art the best mode of carrying out the invention and is 
not intended to illustrate all possible forms thereof. It is also understood that the words used 
30 are words of description, rather that limitation, and that details of the structure may be varied 
substantially without departing from the spirit of the invention and the exclusive use of all 
modifications which come within the scope of the appended claims is reserved. 



